Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming

نویسندگان

  • Robert Aichner
  • Shoko Araki
  • Shoji Makino
  • Tsuyoki Nishikawa
  • Hiroshi Saruwatari
چکیده

We propose a time-domain BSS algorithm that utilizes geometric information such as sensor positions and assumed locations of sources. The algorithm tackles the problem of convolved mixtures by explicitly exploiting the non-stationarity of the acoustic sources. The learning rule is based on secondorder statistics and is derived by natural gradient minimization. The proposed initialization of the algorithm is based on the null beamforming principle. This method leads to improved separation performance, and the algorithm is able to estimate long unmixing FIR filters in the time domain due to the geometric initialization. We also propose a post-filtering method for dewhitening which is based on the scaling technique in frequency-domain BSS. The validity of the proposed method is shown by computer simulations. Our experimental results confirm that the algorithm is capable of separating real-world speech mixtures and can be applied to short learning data sets down to a few seconds. Our results also confirm that the proposed dewhitening post-filtering method maintains the spectral content of the original speech in the separated output. INTRODUCTION Blind source separation (BSS) refers to the problem of recovering signals from several observed linear mixtures. The adjective “blind” stresses the fact that the source signals are not observed and that no information on the mixing process is available. The lack of a priori knowledge of the mixing system is compensated by a statistically strong but physically plausible assumption of independence. The weakness of the prior information is precisely the strength of the BSS model. Thus BSS has received considerable attention in the last few years, and many algorithms have been proposed [4, 5, 6, 10]. However, the separation of broadband signals in reverberant environments remains a challenging problem. This work was performed while the author was with the University of Applied Sciences Regensburg. Currently the author is at the Telecommunications Laboratory, University of Erlangen-Nuremberg. In this paper we consider the BSS of convolutive mixtures of speech. Many researchers have proposed frequency domain algorithms, which in general transform the convolutive mixture problem in the time domain to multiple instantaneous mixtures in the frequency domain. However, in [3] it was shown that for long reverberation the separation performance of frequency domain BSS degrades for long FFT frame sizes. This was our motivation in approaching the convolutive BSS problem in the time domain. CONVOLUTIVE BSS MODEL In real environments, where sound travels slowly compared to the distances in a typical acoustic environment, the signal arrives at the sensors with different time delays. This scenario is referred to as a multi-path environment and can be described as a finite impulse response (FIR) convolutive mixture:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

First International Workshop on Independent Component Analysis and Signal Separation (ICA'99) REAL WORLD BLIND SEPARATION OF CONVOLVED NON-STATIONARY SIGNALS

In this paper, a method of blind separation for convolved non-stationary signals (e.g., speech signals and music) is presented. Our method achieves blind separation by forcing mixed signals to uncorrelate with each other. The validity of the proposed method has been con rmed by a computer simulation and an experiment in an anechoic room [7]. In this paper, we apply our method to an experiment w...

متن کامل

Blind Separation of Jointly Stationary Correlated Sources

The separation of unobserved sources from mixed observed data is a fundamental signal processing problem. Most of the proposed techniques for solving this problem rely on independence or at least uncorrelation assumption for source signals. This paper introduces a technique for cases that source signals are correlated with each other. The method uses Wold decomposition principle for extracting ...

متن کامل

Blind source separation of convolved sources by joint approximate diagonalization of cross-spectral density matrices

In this paper we present a new method for separating non-stationary sources from their convolutive mixtures based on approximate joint diagonalizing of the observed signals’ cross-spectral density matrices. Several blind source separation (BSS) algorithms have been proposed which use approximate joint diagonalization of a set of scalar matrices to estimate the instantaneous mixing matrix. We ex...

متن کامل

The Generalized Sidelobe Decorrelator

We introduce a new hybrid algorithm called the generalized sidelobe decorrelator (GSD) that combines elements of geometric beamforming and blind source separation. On the one hand, it is an extension of the generalized sidelobe canceller (GSC), also known as the Griffiths-Jim beamformer, from the standard criteria of power minimization to a decorrelation criteria. On the other hand, it can be s...

متن کامل

STFT based Blind Separation of Underdetermined Speech Mixtures

Analysis of non stationary signals like audio, speech and biomedical signals require good resolution both in time and frequency as their spectral components are not fixed. There are many applications of time-frequency analysis in non stationary signals like source separation, signal denoising etc. This paper presents an application of time frequency analysis using STFT, Short Time Fourier Trans...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002